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Abstract 

Consider Least Squares Monte Carlo (LSM) algorithm, which is 
proposed by Longstaff and Schwartz (2001) for pricing American style 
securities. This algorithm is based on the projection of the value of 
continuation onto a certain set of basis functions via the least squares 
problem. We analyze the stability of the algorithm when the number 
of exercise dates increases and prove that, if the underlying process 
for the stock price is continuous, then the regression problem is ill- 
conditioned for small values of the time parameter. 

Keywords: option pricing, optimal stopping, American option, Least 
Squares Monte Carlo, Monte Carlo methods, stability, Ill-Conditioning. 

1 Introduction 

The problem of pricing American and Bermudan style options is of funda- 
mental importance in option pricing theory. In the continuous time set- 
tings McKean (1965) proposed an algorithm for pricing an American put 
option with an infinite maturity via Ordinary Differential Equations and 
Partial Differential Equations. Further developments of this technique and 
applications to other American style securities are considered in Peskir and 
Shiryaev (2006). Another approach is pricing via Monte-Carlo simulations 
that is described by Glasserman (2004). One of the most difficult tasks in 
the theory of pricing of American and Bermudan options is the determina- 
tion of an optimal stopping rule and the valuing of the option under such 
a rule. Longstaff and Schwartz (2001) proposed an algorithm for pricing 
American and Bermudan style options via Monte Carlo simulations, Least 
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Squares Monte Carlo or LSM. This technique is especially useful when we 
deal with multi-factor processes. In this case the methods based on bino- 
mial, trinomial trees, or partial differential equations become slow and thus 
inefficient due to the high dimensionality of the problem. 

As in the majority of the numerical algorithms the starting point of LSM 
for American options is a substitution of the continuous time interval with 
a discrete set of exercise dates. Practically, by doing this we substitute the 
American option with a Bermudan one. Then for each exercise time (except 
the first and the last one) we project the value of continuation onto a set 
of basis functions via linear regression. Clement, Lamberton and Protter 
(2002) investigated the convergence of the algorithm with the growth of the 
number of the basis functions and the Monte Carlo simulations. Under fairly 
general conditions they proved the almost sure convergence of the complete 
algorithm. Also, they obtained the rate of convergence when the number of 
Monte Carlo simulations increases and showed that the normalized error of 
the algorithm is asymptotically Gaussian. However, they considered a fixed 
partition of the time interval and thus, essentially, they discussed the prop- 
erties of the Bermudan, not American option. Glasserman and Yu (2004) 
investigated the behavior of LSM with the simultaneous grows of the number 
of the basis functions and the number of the Monte-Carlo simulations and 
estimated the rate of convergence in some more specific settings. Moreno 
and Navas (2001) considered the LSM for different basis functions, namely, 
power series, Laguerre, Legendre, Chebyshev polynomials, and deduced that 
the algorithm converges at least for American Put options when the under- 
lying problem has a small number of factors. Stentoft (2004) obtained the 
rate of convergence of the algorithm in the two period multidimensional 
case. 

In the present work we consider the stability of LSM algorithm, when 
the number of exercise dates increases in such a way that there are exercise 
dates close to an initial time, which we assume to be equal to zero without 
loss of generality. We prove that the algorithm is unstable when the time 
parameter is close to zero, because the underlying regression problem is 
ill-conditioned. 

The remainder of this work is organized as follows. In Section [2l we 
describe the algorithm. In Section [3l we prove the main result, which is 
formulated in Proposition [H instability of the algorithm for the small values 
of the time parameter due to the ill-conditioning of the corresponding matrix 
in the regression problem. In addition we present the results of the numerical 
simulations that illustrate the assertions of Proposition [TJ In Section [U we 
give the concluding remarks. 
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2 Description of the Algorithm 



Assume that the stock price process X = {^t}tg[o,T] is given by the strong 
soluton to the following Stochastic Differential Equation: 

dXt = fitdt + atdWt, , . 

Xo = xo. ^ ' 

Here xq € is a constant, W = {W^tj^gpT] i^ ^ Brownian Motion (possi- 
bly multidimensional) on a filtered probability space ^ri, J-", {J^t}t£[oT] ' 
where P is the risk-neutral probability measure, fi and a are the progres- 
sively measurable functionals, such that the strong solution to equation ([T|) 
exists and unique on the time interval [0, T], see Karatzas and Shreve [6], 
Chapter 5, for the discussion of this topic. The time horizon is T, which we 
assume to be a finite constant. Usually equations of such a form are used to 
describe the evolution of the stock prices in practice. Let the payoff of an 
American option at the time of exercise r is given by if (Xr) , where (p is the 
corresponding payoff function. Then the value of the option is determined 
by the formula: 

uiXo) = supE\e--fo''^'^''ipiXr) . 

Here r = {rt}jgjQj,j is an interest rate process, which we assume to be 
deterministic for simplicity; A is the set of the stopping times with respect 
to the filtration {^t}t^[oT] ■ 

To approximate J^(Xo) numerically let us conduct N Monte-Carlo sim- 
ulations of the process X. First, we need to divide the time interval [0, T] 
into M subintervals [tm, tm+i] of the length At = j^, where tm — -^'rn, 
m = 1, M. Thus at every moment tm we obtain realizations of the 
process X]}^, n = 1, A. Second, for each simulation we compute the value 
of the option at time tM = T (under the assumption that the option was 
not exercised before T): 

Cl^^^iXl^), n = l, A. 

Discounting these values we get a cash fiow vector 

where d = exp{— f^^ ^ Tudu) is the discount factor. 

To obtain the value of the option at tM-i-, Ctj^,i_^ (under the assumption 
the option was not exercised before tM-i), we chose a hypothesis of linear 
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regression and project the cash flow vector bM-i, for example, on a constant, 
Xtj^j_-^, and X^^^__^. According to [5] this is one of the simplest yet success- 
ful regression models. According to [8] a good alternative choice of basis 
functions can be Hermite, Laguerre, Legendre, or Chebyshev polynomials. 
If we use and as the basis, the estimate of the conditional 

expectation becomes 

E[CiM-il-^tM-i] = « + /3^tM-i +7^L_,, (2) 

where a, 13, 7 are some constants. Then along every path we compare 
values of immediate exercise, ip{X^^__^), with values of continuation that 
are obtained by substitution of Xl^^__^ into equation ([2]). The bigger of two 
gives C^^_^, n = 1, N. If value of immediate exercise is bigger we set 

Similarly we obtain E [Ct^ j-^tm] foi" each m E {M — 1, . . . , 1} via solving 
linear regression problems A{tm)x{tm) = b{tm), where components of the 
matrix A(tm) depend on the regression hypothesis and the outcomes of the 
simulations, x{tm) is an unknown vector of coefficients, and the vector 
is given by equation 

where d = exp ^— //j^"^^ ruduj is the discount factor. 

Finally we discount the cash flow up to the moment of time to = and 
compare it with the value of immediate exercise at time to, Lp{Xtg). The 
bigger is the value of the option. 

3 Ill-Conditioning for small t 

Let V = {tm = '^''^ ■ = 1, . . . , M — l| be a partition of the interval 
[T/M,T{M — 1)/M]. In order to compute the estimates of the value of the 
option at each t £ V (under the assumption that it was not exercised before) 
we solve the linear regression problem 

Ait)xit) = b{t), (4) 

where x{t) is an unknown vector of coefficients, vector b{t) is determined 
by equation and the matrix A{t) depends on the regression hypothesis 
and the outcome of the Monte Carlo simulations. Assume that we have 
chosen K continuous functions /i, • • • , fx as the hypothesis. Examples of 
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such functions are power series, Laguerre, Legendre, Hermite polynomials, 
etc. In this case A{t) has the following form 



( 





) 



(5) 



) 



where is the number of the simulations. We show that, if the underlying 
process X is almost surely continuous, then for small t the problem is 
ill-conditioned. For a matrix A in the I2 norm the condition number is 
defined as 



where (Tmaxi^) ^-nd (^mini^) are maximal and minimal singular values of 
the matrix A respectively. A problem with a low condition number is called 
well-conditioned, while a problem with a high condition number is called 
ill-conditioned. 

Usually, problem ^ is solved via one of the following methods: House- 
holder triangularization, Gram-Schmidt orthogonalization, singular value 
decomposition, or normal equations. Let k be the condition number of 
matrix A. If (^A^A) exists then the exact solution to the Least-Squares 
Problem is given by the vector x = (j4-^j4) ^ A^b, i.e. it is a product of the 
left-inverse of the matrix A and the vector b. One can see that the solution to 
the problem obtained via normal equations, is governed by k^, whereas 
the solution obtained via SVD, Householder or Gram-Schmidt is governed 
by K. Consequently, normal equations are the least stable with respect to 
the grows of the condition number. Nevertheless, the analytical solution to 
the Least-Squares Problem is defined in terms of the normal equations. 

Let K{t) denote the condition number of the matrix A(t) for t S [0,T]. 

We show below that limK(t) = 00 P-almost surely. Therefore, no mat- 

40 

ter what algorithm one uses (Householder, Gram-Schmidt, SVD or normal 
equations), for small values of time the underlying regression problem is 
ill-conditioned, and thus the algorithm is unstable. 

We prove ill-conditioning for an arbitrary number of basis functions in 
the following proposition. In addition, we illustrate the phenomenon with 
the results of the numerical simulations in the case when regression is done 
on three basis functions 1, x, and x^. 

Proposition 1 Assume that the process X is given by equation f^, 
k = 1, K, are continuous functions, such that Ylk=i fk (-^o) 



k{A) ^ \\A\\ ■ \\A 



-1 




(6) 
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Figure 1: ln{K{A)) as a function of ln{t), t € (0.009,1], X - lognormal 
process, n = 0, a = 0.15, 30000 paths, 100 time steps, Milstein discretization 
scheme. 



for each t G [0, T] the matrix A{t) is defined by equation ^ and K{t) is the 
condition number of A{t). Then 



hm K.(t) 



oo 



1. 



(7) 



Proof. Consider equation ([5]). It follows from equation ([T]) that all rows of 
j4(0) are identical. Consequently the rank of ^(0) equals to 1. Let us look 
at the following matrix: 



/ fKxo) 

/2(Xo)/i(Xo) 



/l(Xo)/2(Xo) 



fliXo)fKiXo) \ 
f2iXo)fK{Xo) 



\ fKiXo)fliXo) fK{Xo)f2{Xo) 



fUXo 



) I 



(8) 
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Figure 2: In {k{A)) as a function of ln{t), t € (0.009, 1], X - normal process, 
/i = 0, fj = 0.03, 30000 paths, 100 time steps, Euler discretization scheme. 



The matrix (^A^A) {0)/N has two eigenvalues: Ylk=i fki-^o) 

(with multiplicity 1), is the second one (with multiplicity K — 1). Thus 



^4(0) has singular values y A^X^^i /|(^o) and 0, consequently k(0) = oo. 

Note that the matrix (^A^A^ (t) is real and symmetric, thus the eigen- 
values of {A^A) {t) are real for all t's. Since eigenvalues are continuous 
functions of the components of the matrix and the components in turn are 
a.s. continuous processes, we deduce that for small t's the first eigenvalue is 
in the neighborhood of X^fcLi /fc (^o)-^? which is greater then zero by the as- 
sumption of proposition, whereas all other eigenvalues are is in the neighbor- 



hood of O. The conclusion, 



lim K(t) = oo 



1 follows from continuity 



of the underlying process X and the basis functions fkS,l<k<K. □ 

Intuitively Proposition [1] shows that for small values of t the condition 
number is large. 



7 



4 Concluding Remarks 

We proved that for a continuous underlying stock price process, LSM algo- 
rithm for pricing American options is unstable when time parameter is small. 
An interesting question is to obtain an exact bound on applicability of this 
algorithm. A possible criterion of applicability is the condition number of 
matrix A(t), K,{t). For example, if K{t) exceeds a certain value, one can treat 
(jl]) as a rank deficient least squares problem (see [3] for details), or switch to 
another method: backward induction or the method introduced by McKean 
[7^ of option pricing via Ordinary Differential Equations or Partial Differ- 
ential Equations considered on a smaller domain. For certain problems it 
is possible to obtain desired accuracy using relatively small number of time 
intervals, then one does not have to solve the regression problem for small 
t's, and consequently the algorithm can be stable. Also, if the underlying 
process X is discontinuous with high probability the algorithm can be stable 
even for small values of the time parameter. 
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